THE CENTRAL CURVE IN LINEAR PROGRAMMING 
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Abstract. The central curve of a linear program is an algebraic curve specified by linear 
and quadratic constraints arising from complementary slackness. It is the union of the 
various central paths for minimizing or maximizing the cost function over any region in the 
associated hyperplane arrangement. We determine the degree, arithmetic genus and defining 
prime ideal of the central curve, thereby answering a question of Bayer and Lagarias. These 
invariants, along with the degree of the Gauss image of the curve, are expressed in terms of 
the matroid of the input matrix. Extending work of Dedieu, Malajovich and Shub, this yields 
an instance-specific bound on the total curvature of the central path, a quantity relevant 
for interior point methods. The global geometry of central curves is studied in detail. 



1. Introduction 

We consider the standard linear programming problem in its primal and dual formulation: 

(1) Maximize c^x subject to Ax. = b and x > 0; 

(2) Minimize b"^y subject to A^y — s = c and s > 0. 

Here A is a fixed matrix of rank d having n columns. The vectors c G M" and b G image (A) 
may vary. In most of our results we assume that b and c are generic. This implies that both 
the primal optimal solution and the dual optimal solution are unique. We also assume that 
the problem ([T]) is bounded and both problems Q and ^ are strictly feasible. 

Before describing our contributions, we review some basics from the theory of linear pro- 
gramming [26}[33]. The (primal) logarithmic barrier function for ([T]) is defined as 

n 

/a(x) := c'^x + A^logXi, 

where A > is a real parameter. This specifies a family of optimization problems: 

(3) Maximize /a(x) subject to Ax = h and x > 0. 

Since the function fx is strictly concave, it attains a unique maximum x*(A) in the interior 
of the feasible polytope P = {x G M>o : Ax = b}. Note that /a(x) tends to — oo when x 
approaches the boundary of P. The primal central path is the curve {x*(A) | A > 0} inside 
the polytope P. There is an analogous logarithmic barrier function for the dual problem ^ 
and a corresponding dual central path. The central path connects the optimal solution of 
the linear program in question with its analytic center. This is the optimal point of /oo or 
equivalently argmaxp(^"^-|^ logXj). The central path is homeomorphic to a line segment. 
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Figure 1. The central curve of six lines for two choices of the cost function 



The complementary slackness condition says that the pair of optimal solutions, to the 
primal linear program Q and to the dual linear program ([2]), are characterized by 

(4) Ax = h , A^y — s = c, x>0, s>0, and XjSj = for i = 1,2, . . . ,n. 

The central path converges to the solution of this system of equations and inequalities: 

Theorem 1 (cf. [33]). If A has d rows, then for all A > 0, the system of polynomial equations 



(5) 



/lx = b, A^y 



and XiSi = X for i 



1,2, 



n, 



has a unique real solution (x*(A), y*(A), s*(A)) with the properties x*(A) > and s*(A) > 0. 
The point x*(A) is the optimal solution of The limit point (x*(0), y*(0), s*(0)) of these 
solutions for A — ?■ zs the unique solution of the complementary slackness constraints Q. 

Our object of study in this paper is the set of all solutions of the equations ([s]), not just 
those whose coordinates are real and positive. For general b and c, this set is the following 
irreducible algebraic curve. The central curve is the Zariski closure of the central path in 
(x, y, s)-space, that is, it is the smallest algebraic variety in M^""'"'^ that contains the central 
path. The primal central curve in is obtained by projecting the central curve into x-space. 
We can similarly define the dual central curve by projecting into y-space or into s-space. 

Example 2. Figure [T] depicts the primal central curve for a small transportation problem. 
Here A is the 5x6 node-edge matrix of the complete bipartite graph K23, as shown below: 
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Here n = 6 and d = A because A has rank 4. We return to this example in Section 4. o 



As seen in Figure [T| and proved in Theorem 33 , the primal central curve contains the 



central paths of every polytope in the arrangement in {Ax. = b} defined by the coordinate 
hyperplanes {xj = 0} for the cost functions c and — c. The union over all central curves, as 
the right hand side b varies, is an algebraic variety of dimension c?+l, called the central sheet, 



which will play an important role. Our analysis will be based on results of Terao 31 and 



Proudfoot-Speyer 24 on algebras generated by reciprocals of linear forms; see also Berget [6] . 
Matroid theory will be our language for working with these algebras and their ideals. 

The algebro-geometric study of central paths was pioneered by Bayer and Lagarias [3|[4]. 
Their 1989 articles are part of the early history of interior point methods. They observed 
(on pages 569-571 of j4j) that the central path defines an irreducible algebraic curve in x- 
space or y-space, and they identified a complete intersection that has the central curve as an 
irreducible component. The last sentence of |4| §11] states the open problem of identifying 
polynomials that cut out the central curve, without any extraneous components. It is worth 
stressing that one easily finds polynomials that vanish on the central curve from the gradient 
optimality conditions on the barrier function. Those polynomials vanish on high-dimensional 
components, other than the central curve. These extra components are contained in the 
coordinate hyperplanes, and the challenge is to remove them in our algebraic description. 

In numerical optimization, the optimal solution to ([T]) is found by following a piecewise- 
linear approximation to the central path. Different strategies for generating the step-by-step 
moves correspond to different interior point methods. One way to estimate the number of 
Newton steps needed to reach the optimal solution is to bound the total curvature of the 
central path. This has been investigated by many authors (see e.g. |ll]|22}|28||34|[37]), the 
idea being that curves with small curvature are easier to approximate with line segments. 
The algebraic results in this paper contribute to the understanding of the total curvature. 

Here is a list of our results. Precise statements are given in each section. 

• Section [2] analyzes central curves in the plane, with emphasis on the dual formulation 



{d = 2). We show that our curves are Vinnikov curves 35 of degree < n — 1 



obtained from an arrangement of n lines by taking a Renegar derivative 25 . The total 



curvature of a plane curve can be bounded in terms of its number of real inflection 



points. We derive a new bound from a classical formula due to Felix Klein 18 . 
All our formulas and bounds in Sections |4} |5| [6| and [7] are expressed in the language 
of matroid theory. A particularly important role is played by matroid invariants, 
such as the Tutte polynomial, that are associated with the matrix A. In Section [3] we 
review the required background from matroid theory and geometric combinatorics. 
In Section |4] we present a complete solution to the Bayer-Lagarias problem. Under 
the assumption that b and c are general, while A is fixed and possibly special, we 
determine the prime ideal of all polynomials that vanish on the primal central curve. 
We express the degree of this curve as a matroid invariant. This yields the tight 
upper bound ("^^) for the degree. For instance, the curves in Figure [l have degree 
five. We also determine the Hilbert series and arithmetic genus of our curve in P". 
Section [5] develops our approach to estimating the total curvature of the central 



curve. Dedieu, Malajovich and Shub 11 noted that the total curvature of any curve 
C coincides with the arc length of the image of C under the Gauss map. Hence 
any bound on the degree of the Gauss curve translates into a bound on the total 
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curvature. Our main result in Section [5] is a very precise bound, in terms of matroid 
invariants, for the degree of the Gauss curve arising from any hnear program. 

• While Sections |4] and |5] focused on primal linear programs. Section [6] revisits our 
results on the degree and curvature, and it translates them to the dual formulation. 
Theorem |27] characterizes the average curvature over the bounded feasibility regions. 

• Section [7] furnishes an entirely symmetric description of the primal-dual central curve 
inside a product of two projective spaces. This leads to a range of results on the global 
geometry of our curves. In particular, we explain how the central curve passes through 
all vertices of the hyperplane arrangement and through all the analytic centers. 

What got us started on this project was our desire to understand the "snakes" of Deza, 
Terlaky and Zinchenko We close the introduction by presenting their curve for n = 6. 

Example 3. Let n = 6, d = 2 and fix the following matrix, right hand side and cost vector: 
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The resulting linear program, in its dual formulation ([2]), is precisely the instance in 13, Fig- 



ure 2, page 218]. We redrew the central curve in Figure |2| The hexagon Pg2 shown there 
equals {y G : A^y > c}. The analytic center of Pg 2 ^ point with approximate coordi- 
nates y = (—0.027978..., 0.778637...). It has algebraic degree 10 over Q, which indicates the 
level of difficulty to write exact coordinates. The optimal solution is the vertex with rational 
coordinates y = (2/1,2/2) = (-flS, -gliio) = (-0.033304..., -0.00086...). 

Following [13], we sampled many points along the central path, and we found that the 
total curvature of the central path equals 13.375481417... . This measurement concerns 
only the part of the central curve that goes from the analytic center to the optimum. Our 



algebraic recipe (19) for computing the central curve leads to the following polynomial: 

(1/2 - 1) (2760518880000000000000000 y| + 22783991895360000000000000 yfyi - 1559398946696532000000000 j/| + 
1688399343321073200000000^/1 yf+87717009913470910818000s/|-3511691013758400000000000yJj/|-324621326759441931317j/2 
+ 11183216292449806548000 J/1J/2 + 2558474824415400000000 y^yj _ 5135843I8OI6OOOOOOOOOOO yfy2 + 6337035495096700140 yi 
+ 776239200000000000001/4 - 13856351760343620000 + 291589604847546655 - 38575873512000000000 j/f). 

This polynomial of degree five has a linear factor ?/2 — 1 because the vector b that specifies 
the objective function in this dual formulation is parallel to the first column of A. Thus 
the central curve in Figure [2] has degree four, and its defining irreducible polynomial is the 
second factor. When the cost vector b is replaced by a vector that is not parallel to a column 
of A then the output of the same calculation (to be explained in Section 4) is an irreducible 
polynomial of degree five. In other words, for almost all b, the central curve is a quintic. 

While most studies in optimization focus only on just the small portion of the curve that 
runs from the analytic center to the optimum, we argue here that the algebraic geometry of 
the entire curve reveals a more complete and interesting picture. For generic b and c, the 
central curve is a quintic that passes through all vertices of the line arrangement defined by 
the six edges of the polygon. As we shall see, it passes through the analytic centers of all 
bounded cells (Theorem 33) and it is topologically a nested set of ovals (Proposition |4]). o 



THE CENTRAL CURVE IN LINEAR PROGRAMMING 5 




Figure 2. The DTZ snake with 6 constraints. On the left, a global view of 
the polygon and its central curve with the line 2/2 = 1 appearing as part of the 
curve. On the right a close-up of the central path and its inflection points. 



2. Plane Curves 

When the central curve lives in a plane, the curve is cut out by a single polynomial 
equation. This occurs for the dual curve when d = 2 and the primal curve when n = d — 2. 
We now focus on the dual curve {d = 2). This serves as a warm-up to the full derivation of 
all equations in Section |4j In this section we derive the equations of the central curve from 
first principles, we show that these curves are hyperbolic and Renegar derivatives of products 
of lines, and we use this structure to bound the average total curvature of the curve. 

Let A = {ttij) be a fixed 2 x n matrix of rank 2, and consider arbitrary vectors b = 
{bi, ^2)^ £ and c = (ci, . . . , c„)"^ G M". Here the y-space is the plane with coordinates 
y = {yi, 2/2)- The central curve is the Zariski closure in this plane of the parametrized path 

n 

y*(A) = argmin biyi + 622/2 - 52 logl*^!*?/! + '^2*2/2 - q). 

{y : ATy>c} ^ 

The conditions for optimality are obtained by setting the first partial derivatives to zero: 

n n 

= b,-xy ^ and = b2-Xy ^ . 

^ auVi + a2i2/2 - Ci ^ aiiVi + 02^2/2 - Cj 

Multiplying these equations by 62 /A or 61 /A gives 

/„N 6162 b2aii sr-^ bia2i 



auVi + ^2*2/2 - Ci ^ auyi + 02^2/2 - Q 



This eliminates the parameter A and we are left with the equation on the right. By clearing 
denominators, we get a single polynomial C that vanishes on the central curve in y-space: 



(7) 



C(y) = y^{bia'2i - b2aii) JJ (aij2/i + a2j2/2 - 9), 
iex iex\{i} 
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Figure 3. The degree-6 central path of a planar 7-gon in the affine charts 
{Uo = 1} and {y2 = 1}. Every line passing through [0 : —62 : &i] intersects the 
curve in 6 real points, showing the real curve to be 3 completely-nested ovals. 



where X = {z : 6ia2i — &2«ij 7^ 0}. We see that the degree of C(y) is |X| — 1. This equals n — 1 
for generic b. In our derivation we assumed that A is non-zero but the resulting equation is 
valid on the Zariski closure, which includes the important points with parameter A = 0. 

We consider the closure C of the central curve in the complex projective plane with 
coordinates [i/q : yi : 1/2] • Thus C is the complex projective curve defined by y|f ^C{^, 

Proposition 4. The curve C is hyperbolic with respect to the point [0 : —62 '■ bi]. This 
means that every line in P^(]R) passing through this special point meets C only in real points. 

Proof. Any line passing through the point [0 : —62 '■ &i] (except the line {yo = 0}) has the 
form {biyi + 62^/2 = ^0^0} ioi some Bq E M. See the left picture in Figure [3j We shall see in 
Remark 36 that, for any bo G M, the line meets C in deg(C) real points. 



□ 



Hyperbolic curves are also known as 
work 



light of Vinnikov's seminal 
Semidefinite programming 
has been generalized to hyperbolic programming, in the work of Renegar 25 and others. A 



20,35 



Vinnikov curves, m 
relating them to semidefinite programming [27] 



key construction in hyperbolic programming is the Renegar derivative which creates a (hy- 
perbolic) polynomial of degree D — 1 from any (hyperbolic) polynomial of degree D. To be 
precise, the Renegar derivative of a homogeneous polynomial / with respect to a point e is 

^e/(y) = (if{y + te) 



dr 



t=0 



Renegar derivatives correspond to the polar curves of classical algebraic geometry 14, §1.1]. 
The Renegar derivative of / = Yliexi'^^iy^ + (i2iy2 — Ql/o) with e = (0, —62, &i) is seen to be 

(8) ^e/(y) = '^{bia2i - b2aii) Yl i^ijyi + a2jy2 - Cjyo) = C(y). 

iei jex\{i} 

In words: the central curve C is the Renegar derivative, taken with respect to the cost 
function, of the product of the linear forms that define the convex polygon of feasible points. 
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The product of linear forms / = Hjl'^ij?/! + '^2il/2 — Ql/o) is a hyperbolic polynomial with 
respect to e. Renegar [25] shows that if / is hyperbolic with respect to e then so is -Re/- 
This yields a second proof for Proposition |4} 

Proposition |4] is visualized in Figure |3j The picture on the right is obtained from the 
picture on the left by a projective transformation. The point at infinity which represents the 
cost function is now in the center of the diagram. In this rendition, the central curve consists 
of three nested ovals around that point, highlighting the salient features of a Vinnikov curve. 
This beautiful geometry is found not just in the dual picture but also in the primal picture: 

Remark 5. If d = n — 2 then the primal central curve lies in the plane {Ax = b}. The 
conditions for optimality of state that the vector V(^^loga;j) = (x^^, . . . ,x~^) is in the 
span of c and the rows of A. The Zariski closure of such vectors is the central sheet, to be 
seen in Section 4. Here, the central sheet is the hypersurface in M" with defining polynomial 





(9) 

where Ai is the ith column of A and X = {i : {{^^.)}je[n]\i are linearly independent}. We 
see that the degree of this hypersurface is |X| — 1, so it is n — 1 for generic A. Intersecting this 
surface with the plane {Ax = b} gives the primal central curve, which is hence a curve of 
degree |X| — 1. The corresponding complex projective curve in P2 = { [xo : x] I Ax = xoh} C 
P" is hyperbolic with respect to the point [0 : v] in P", where v spans the kernel of (^). o 

It is of importance for interior point algorithms to know the exact total curvature, formally 



introduced in equation (24), of the central path of a linear program (see |ll||22||28|[34|[37] ). 
Deza et al. fls] proved that even for d = 2 the total curvature grows linearly in n, and 
they conjectured that the total curvature is no more than 2?™. They named this conjecture 
the continuous Hirsch conjecture because of its similarity with the discrete simplex method 
analogue (see [l2]). In Section |5] we derive general bounds for total curvature, but for plane 
curves we can exploit an additional geometric feature, namely, inflection points. 

Benedetti and Dedo |5| derived a general bound for the total curvature of a real plane 
curve in terms of its number of inflection points and its degree. We can make this very 
explicit for our central path {y*(A) : A G M>o}. Its total curvature is bounded above by 

(10) total curvature of the central path < tt ■ (its number of inflection points + 1). 

To see this, consider the Gauss map 7 that takes the curve into the circle by mapping 
any point on the curve to its unit tangent vector. See Section |5] for the general definition. 
The total curvature is the arc length of the image of the Gauss map. As A decreases from 
00 to 0, the cost function b-^y*(A) strictly decreases. This implies that, for any point y*(A) 
on the curve, its image under the Gauss map has positive inner product with b, that is, 
b"^7(y*(A)) > 0. Thus the image of the Gauss map is restricted to a half circle of S^, and it 
cannot wrap around S^. This shows that the Gauss map can achieve a length of at most vr 
before it must "change direction", which happens only at inflection points of the curve. 

It is known that the total number of (complex) inflection points of a plane curve of degree 
D is at most 3D{D — 2). For real inflection points, there is an even better bound: 

Proposition 6 (A classical result of Felix Klein [18j). 

The number of real inflection points of a plane curve of degree D is at most D{D — 2). 
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This provides only a quadratic bound for the total curvature of the central path in terms of 
its degree, but it does allow us to improve known bounds for the average total curvature. The 
average total curvature of the central curve of a hyperplane arrangement is the average, over 
all bounded regions of the arrangement, of the total curvature of the central curve in that 
region. Dedieu et al. [ll] proved that the average total curvature in a simple arrangement 
{i.e. for a generic matrix A) defined by n hyperplanes in dimension d is not greater than 
211(1. When d = 2, we can use Proposition |6] to improve this bound by a factor of two. See 



Theorem 27 for the extension to general d. 



Theorem 7. The average total curvature of a central path of a generic line arrangement in 
the plane is at most 2tt. 

Proof. The central curve for n general lines in M? has degree n — 1 and consists of n — 1 (real 
affine) connected components. The argument above and Klein's theorem then show that 

71—1 n—1 

^^(curvature of the zth component) < tt (^inflection points on the ith component + 1) 

i=l i=l 

< 7r(n- l)(n-2). 

Our arrangement of n general lines has ("2 ^) bounded regions. The average total curvature 
over each of these regions is therefore at most ^{n — l){n — 2) / ("2^) = 27r. □ 

To bound the curvature of just the central path, we need to bound the number of inflection 
points appearing on that piece of the central curve. To address this issue, we posed the 
following problem in the manuscript version of this article: What is the largest number 
of inflection points on a single oval of a hyperbolic curve of degree D in the real plane? In 
particular, is this number linear in the degree D ? These questions have since been answered 
by Erwan Brugalle and Lucia Lopez de Medrano, using an extension of their techniques in [o] . 
They constructed a Vinnikov curve of even degree D which has the maximal number D{D—2) 
of inflection points and all of these inflection points lie on the outermost oval. It would be 
very interesting to see whether their approach can be applied to improve the DTZ snakes of 
Example [3] and lead to new lower bounds for the total curvature of planar central paths. 

3. Concepts from Matroid Theory 

We have seen in the previous section that the geometry of a central curve in the plane is 
intimately connected to that of the underlying arrangement of constraint lines. For instance, 
the degree of the central curve, |X| — 1, is one less than the number of constraints not parallel 
to the cost function. The systematic study of this kind of combinatorial information, encoded 
in a geometric configuration of vectors or hyperplanes, is the subject of matroid theory. 

Matroid theory will be crucial for stating and proving our results in the rest of this paper. 
This section offers an exposition of the relevant concepts. Of course, there is already a well- 
established connection between matroid theory and linear optimization (e.g., as outlined in 
[19j or in oriented matroid programming [2]). Our paper sets up yet another connection. The 
material that follows is well-known in algebraic combinatorics, but less so in optimization, 
so we aim to cover the basic facts. The missing details can be found in [7, 10 



We consider an r-dimensional linear subspace C of the vector space K"^ with its fixed 
standard basis. Here K is any field. Typically, £ will be given to us as the row space of an 
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r X n-matrix. The kernel of that matrix is denoted by C^. This is a subspace of dimension 
n — r in K"^. We write Xi, . . . , x„ for the restriction of the standard coordinates on K"- to C 
The two subspaces £ and specify a dual pair of matroids, denoted M(£) and M{C^), 
on the set [n] = {1, . . . , n}. The matroid M{C) has rank r and its dual M{C-^) = M{C)* has 
rank n — r. We now define the first matroid M = M{C) by way of its independent sets. A 
subset / of [n] is independent in M if the linear forms in {xi : z G /} are linearly independent 
on C. Maximal independent sets are called bases. These all have cardinality r. A subset / 
is dependent if it is not independent. It is a circuit if it is minimally dependent. 

Example 8. Consider the linear space C spanned by the rows of the rank 4 matrix A 
in Example [2] Because the first four columns of A are linearly independent, the linear 
forms {xi,X2,X3,Xi} are linearly independent on C and {1,2,3,4} is an independent set of 
M{C). As C has dimension four, {1,2,3,4} is a basis of the matroid M{C). On the other 
hand, the set of columns {1,2,4,5} is linearly dependent but every proper subset is linearly 
independent. Hence {1,2,4,5} is a circuit of M(£). From similar considerations, we find 
that M(£) has nine circuits, namely {i,j,k,l} where i,j G {1,2,3} and k,l G {4,5,6}. o 

One matroid application of importance for our study of central curves is the following 
formula for number of bounded components of a hyperplane arrangement. Let u be a 
generic vector in and consider the (n — r)-dimensional affine space £^ + u of M". The 
equations Xi = define n hyperplanes in this affine space. The arrangement {xi = 0}i£[n] in 
+ u is simple, which means that no point lies on more than n — r of the n hyperplanes. 
The vertices of this hyperplane arrangement are in bijection with the bases of the matroid 
M. The complements of the hyperplanes are convex polyhedra; they are the regions of the 
arrangement. Each region is either bounded or unbounded, and we are interested in the 
bounded regions. These bounded regions are the feasibility regions for the linear programs 



with various sign restrictions on the variables Xj. Proposition 6.6.2 in 10 , which is based 
on results of Zaslavsky [36], equates the number of such regions with the absolute value of 
the Mohius invariant /i(M) of the matroid of M: 

(11) |/i(M)| = # bounded regions of the hyperplane arrangement {xi = 0}jg[n] in £-'-+u. 

Further below, in Equation (13), this invariant will be expressed in terms of the matroid M. 
We refer to |/i(M)| as the Mohius number of the matroid M. 

To obtain the Mobius number and more refined invariants that we will need in Section |5] 
we introduce a simplicial complex associated to the matroid M called the broken circuit 
complex. We fix the standard ordering 1 < 2 < ■ ■ ■ < n oi [n]. A broken circuit of M is any 
subset of [n] of the form C\{min(C)} where C is a circuit. The broken circuit complex of M 
is the simplicial complex Br(M) whose minimal non-faces are the broken circuits. Hence, a 
subset of [n] is a face of Br(M) if it does not contain any broken circuit. It is known that 
Br(M) is a shellable simplicial complex of dimension r — 1 (see Theorem 7.4.3 in [?]). We 
can recover the Mobius number of M as follows. Let fi = /j(Br(M)) denote the number 
of i-dimensional faces of the broken circuit complex Br(M). The corresponding h- vector 
{ho, hi, ... , hr-i) can be read off from any shelling (cf. fh §7.2] and l29l §2]). It satisfies 



^ fi-iz' _ hp + hiz + h2z'^ H hK-iz"" 



The relation between the f-vector and the h-vector holds for any simplicial complex 29 



The rational function (12) is the Hilbert series (see 129]) of the Stanley-Reisner ring of the 
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broken circuit complex Br(M). The defining ideal of the Stanley- Reisner ring is generated 
by the monomials niec\{mm(c)} -^^ representing broken circuits. Proudfoot and Speyer f2^ 
constructed a broken circuit ring, which is the quotient of [xi, . . . , x„] modulo a prime ideal 



whose initial ideal is precisely this monomial ideal. Hence (12) is also the Hilbert series of 



the ring in 24 . In particular, the Mobius number is the common degree of both rings: 



(13) HM)\ = ho + hi + h2 + --- + hr-i. 

Example 9 (Uniform matroids). If £ is a general r-dimensional subspace of K"- then M = 
M{C) is the uniform matroid M = Ur^n, whose bases are all r-subsets in [n] and whose 
circuits are all (r + l)-subsets of [n]. The broken circuits of M are then all the r-subsets of 
{2, . . . , n}. The broken circuit complex Br(M) is the (r — l)-dimensional simplicial complex 
on [n] whose maximal simplices are {1, ji, . . . , jV-i} where 2 < ji < ■ ■ ■ < jV-i < n. This 
shows that fi equals ("7^) for 1 < 2 < r - 1. Using one finds that hi = (""7'"^)- We 



can then use (13) to compute the Mobius number of the uniform matroid M = Ur, 

, , , , /n — r + i — l\ (n — X\ 

(14) IMf/..)l = E( , ) ^ \r-l\ 



This binomial coefficient is an upper bound on |/x(M)| for any rank r matroid M on \n\. 

To understand the geometric interpretation of |/i(M)|, let us identify with ]R"~^. Here 
we are given n general hyperplanes through the origin in M""*", and we replace each of them 
by a random parallel translate. The resulting arrangement of n affine hyperplanes in 



creates precisely bounded regions, as promised by the conjunction of (14) and (11). 



o 



The Mobius number is important to us because it computes the degree of the central 
curve of the primal linear program ([T]). See Theorem 13 below. We will take r = c? + 1 and 



L = Ca,c to be the linear space spanned by the rows of A and the vector c. The matroid 
M{Ca,c) we need there has rank r = d + 1 and it is denoted Ma,c- We use the notation 

(15) |MAc)| := HMa,c)\ = HMiCA,c))\- 

Consider Example [2| with A the displayed 5x6-matrix of rank c? = 4, or the instance in 
Figure |3} Here, n = 6,r = d+ l = 5, and c = ^5.6 is the uniform matroid. Its Mobius 
number equals |/i(y4, c)| = \fi{U5fi)\ = 5. This number 5 counts the bounded segments on 
the vertical line on the left in Figure [3j Note that the relevant matroid for Example [2] is 
not, as one might expect, the graphic matroid of -^'2,3- For higher-dimensional problems the 
matroids Ma,c we encounter are typically non-uniform. 

There are many other interpretations of the Mobius invariant and the h-vector. For 
example, a useful identity for computations is the h-vector as an evaluation of the Tutte 
polynomial TM{x,y) of the matroid M (see [7| Eq. (7.15)] and the discussion in [7| §7.2]): 

(16) ho + hiz + h2Z^ + ■ ■ ■ + K-iz'-^ = ^"■Tm(1A,0). 



4. Equations defining the central curve 

In this section we determine the prime ideal of the central curve of the primal linear 
program ([T|. As a consequence we obtain explicit formulas for the degree, arithmetic genus 
and Hilbert function of the projective closure of the primal central curve. These results 
resolve the problem stated by Bayer and Lagarias at the end of |4l §11]. 



THE CENTRAL CURVE IN LINEAR PROGRAMMING 



11 



Our ground field is K will be Q(A)(b, c). Here Q(A) denotes the subfield of M generated 
by the entries of A and Q(A)(b, c) is the rational function field generated by the coordinates 
hi and Cj of the right hand side b and the cost vector c. We assume that these coordinates 
are algebraically independent over Q(A). This is a formal way of ensuring that our algebraic 
results remain valid for generic values of hi and Cj in M. 

Let Ca,c be the subspace of spanned by the rows of A and the vector c. We define 
the central sheet to be the coordinate-wise reciprocal Cj^^ of that linear subspace. In precise 
terms, we define to be the Zariski closure in the affine space C" of the set 



(17) < — G C" : M2, • • • , Un) e Ca,c and 7^ for i = 1 

Ui U2 u 



Lemma 10. The Zariski closure of the primal central path {x*(A) : A G ]R>o} is equal to 
the intersection of the central sheet C~^^ with the affine-linear subspace defined by Ax = h. 

Proof. We eliminate s, y and A from the equations A'^y — s = c and XjSj = A as follows. We 
first replace the coordinates of s by Sj = A/xj. The linear system becomes A^y — Ax~^ = c. 
This condition means that x~^ = . . . , ^)"^ lies in the linear space Ca,c spanned by c 
and the rows of A. The result of the elimination says that x lies in the central sheet C^\- 
For X in the Zariski-dense set fl (C*)", one can reconstruct values of A,y,s for which 
(x, y, s, A) is a solution to the equations A'^y — s = c, XiSi = A. This shows that is 
indeed the projection of the set of these solutions onto the x-coordinates. □ 

The linear space {Ax = b} has dimension n — d, and we write lA,h for its linear ideal. 
The central sheet C'^^^ is an irreducible variety of dimension d+1, and we write Ja,c for its 
prime ideal. Both /^ b and Ja,c are ideals in K[xi, . . . ,x„]. We argue the following is true: 

Lemma 11. The prime ideal of polynomials that vanish on the central curve C is lA,h + Ja,c- 
The degree of both C and the central sheet coincides with the Mobius number \ii{A,c)\. 

Proof. The intersection of the affine space {Ax = b} with the central sheet is the variety 
of the ideal lA,b + Ja,c- This ideal is prime because b and c are generic over Q(A). The 
intersection is the central curve. In Proposition 12 we show that the degree of the central 
sheet is c)|, so here it only remains to show that this is the degree of the central 

curve as well. For a generic vector (b, cq) G M'^"'"^, we consider the hyperplane arrangement 
induced by {xi = 0} in the affine space {(^)x = (^)}- The number of bounded regions of 
this hyperplane arrangement equals the Mobius number |/i(yl,c)|, as seen in (11). 

Each of these bounded regions contains a unique point maximizing Yli^'^S This point 
is the analytic center of that region. Each such analytic center lies in C'^\, and thus on the 
central curve by Lemma 10 This shows that the intersection of the central curve with the 
plane {c^x = cq} contains at least |/i(y4,c)| points. 

Bezout's Theorem implies that the degree of a variety F C C" is an upper bound for the 
degree of its intersection V r\H with an affine subspace H, provided that n + dim(\^ HH) = 
dim{V) + dim(if). We use this theorem for two inequalities; first, that the degree of 
bounds the degree of the central curve C, and, second, that the degree of C bounds the 
number of its intersection points with {c^x = Cq}. To summarize, we have shown: 

\KAc)\ < #(Cn{c^x = co}) < deg(C) < deg{C^]J = HA,c)\. 

From this we conclude that \fi{A, c)| is the degree of the primal central curve C. □ 
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At this point we are left with the problem of computing the degree of the homogeneous 
ideal Ja,c and a set of generators. Luckily, this has already been done for us in the literature. 



The following proposition was proved by Proudfoot and Speyer 24 and it refines an earlier 



result of Terao 31 . See also [6j for related results. The paper 30 begins the challenging 
task of extending these results from linear programming to semidefinite programming. 



Proposition 12 (Proudfoot-Speyer 



24 



The degree of the central sheet C 



1 



regarded as 

a variety in complex projective space, coincides with the Mohius number \^{A,c)\. Its prime 
ideal Ja,c is generated by a universal Grobner basis consisting of all homogeneous polynomials 

(18) ^ i;,- n 

iesupp(D) jesupp(i;)\{j} 

where 'Y^ViXi runs over non-zero linear forms of minimal support that vanish on Ca,c- 

Proof. The construction in [24] associates the ring if [xi, . . . , Ja,c to the linear subspace 
Ca,c of K^. Theorem 4 of 24 says that the homogeneous polynomials (18) form a universal 



Grobner bases for Ja,c- As argued in 24, Lemma 2], this means that the ring degenerates to 
the Stanley- Reisner ring of the broken circuit complex 'Bi{Ma,c)- Hence, by our discussion in 
Section 3, or by |24| Prop. 7], the Hilbert series of K[xi, . . . , x„]/ Ja,c is the rational function 
(12), and the degree of Ja,c equals |/i(A, c)| as seen in (13). The ideal J^c is radical, since 
its initial ideal is square-free, and hence it is prime because its variety is irreducible. □ 



The polynomials in (18) correspond to the circuits of the matroid Ma,c-, of which there 



are at most (^"2) • matrix A is generic, then Ma,c is the uniform matroid and every 

{d + 2)-subset of {1, ... , n} forms a circuit. In this case, by (14), its Mobius number equals 



\^l{A,c) 



n — 1 
d 



For arbitrary matrices A, this binomial coefficient furnishes an upper bound on the Mobius 
number |/i(A, c)|. We are now prepared to conclude with the main theorem of this section. 
The analogous equations for the dual central curve are given in Proposition 24 in Section [7j 



Theorem 13. The degree of the primal central curve of |ip is the Mobius number \j^{A^ c)| 
and is hence at most ■ The prime ideal of polynomials that vanish on the primal central 
path is generated by the circuit polynomials (18) and the d linear polynomials in Ax. — b. 



Proof. This is an immediate consequence of Lemmas 10 and 11 and Proposition 12 



□ 



It is convenient to write the circuit equations (18) in the following determinantal repre- 



sentation. Suppose that A has format dxn and its rows are linearly independent. Then the 
linear forms of minimal support that vanish on Ca,c are the (0? -|- 2) x {d + 2)-minors of the 

(d + 2) X n matrix I c ] . This gives the following concise description of our prime ideal Ja,c'- 



A 



(19) 



J, 



'num,ii+2 



-1 



-1 



X. 



and the operator /num,d+2 extracts the numerators of the {d+2) x 



where x 

((i-|-2)-minors of the matrix. For example, if the leftmost [d + l) x {d-\- 2) submatrix of the 
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matrix has full rank d + 1, then, as in (|9]), one generator of the ideal Ja c equals 

/ A2 ... Ad+2 

det Ci C2 
\x7^ x~ 



Cd+2 



n 

iex 



1 -^2 • • • -^(1+2/ 

where X is the unique circuit contained in {1,2, . . . ,d + 2}. Note that there are (^"2) ^^'^^ 
minors but they need not be distinct and some of them may be zero. 

Example 14. Let d = 4,n = 6 and A the matrix in Example [2] The linear ideal is 

lA,h = ( Xi + X2 + X3 - 61 , X4 + X5 + X6 - 62 , a;i + X4 - 63 , X2 + X5 - 64 ). 

The central sheet is the quintic hypersurface whose defining polynomial is 



(20) 



/a,c(x) 



det 



/ 1 


1 


1 








\ 











1 


1 


1 


1 








1 











1 








1 





Cl 


C2 


C3 


C4 








X2 


X3 




-5-^ 





XiX2X3X4^X5Xq. 



The primal central curve is the plane quintic defined by the ideal lA,b + (/a,c)- This ideal is 
prime for general choices of b and c. However, this may fail for special values: the quintic 
on the left in Figure [T] is irreducible but that on the right decomposes into a quartic and a 
line. For a concrete numerical example we set 61 = 62 = 3 and 63 = 64 = 65 = 2. Then the 
transportation polygon P is the regular hexagon depicted in Figure [1} Its vertices are 

. . ,'0 1 2\ fo 2 l\ fl 2\ fl 2 0\ f2 l\ ^2 1 0^ 

[Zi) 



2 1 



2 1 



1 2 



10 2 



2 1 



1 2 



and c' 



Consider the two transportation problems ( 1 ) given by c = 

In both cases, the last matrix in (21 ) is the unique optimal solution. Modulo the linear ideal 
-^A,b we can write the quintics /a,c and fA,c' as polynomials in only two variables Xi and X2'. 

3xfx2 + 5x1x2 — 2x1X2 — 3xf — 22x\x2 — 15xf + 8x1X2 + 2x| 
+18xf + 45x?X2 - 12x1 - 33x? - 22xiX2 + 22x2 ^ ^g^^ _ ^2^2, 

_ (x2 — 1) ■ {2x\ + 4xf X2 + x\x\ — x\x\ — \2x\ — 14x1x2 + xix| 
~ +xi + 22x? + 10x1X2 -5x^-12x1 + 6x2). 

Both quintics pass through all intersection points of the arrangement of six lines. The cost 
matrix c exemplifies the generic behavior, when the quintic curve is irreducible. On the 
other hand, the central path for c' is a segment on the horizontal line X2 = 1 in Figure [ij o 

Remark 15. When b or c is not generic, various aspects of the above analysis break down. 
If b is not generic, then the hyperplane arrangement {xj = 0}ig[.„] C {Ax = b} may not be 
simple, that is, it may have a vertex at which more than n — d hyperplanes meet. This vertex 
will maximize c^x over more than one adjoining region of the arrangement. In particular, 
the central curve passes through this vertex more than once and is singular at this point. 

If the cost function c is maximized at a (non-vertex) face of a region of the hyperplane 
arrangement {xj = 0}jg[„] C {Ax = b}, then the central curve meets this face in its analytic 
center and does not pass through any of the vertices of the hyperplane arrangement contained 
in the affine span of this face. For example, see Figure [2j Another potential problem is that 
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for non-generic c the curve defined by the equations of Theorem 13 may be reducible, as 
happens for the cost vector c' in Example [l4j The central curve will then be whatever 
component of these solutions passes through the region of interest. In particular, its degree 
and equations are no longer independent of the sign conditions on x. Fortunately, the Mobius 
number \fi{A, c)| is always an upper bound for the degree of the central curve. o 

In the remainder of this section we consider the question of what happens to the central 
sheet, and hence to the central path, when the cost function c degenerates to one of the 
unit vectors e^. Geometrically this means that the cost vector becomes normal to one of the 
constraint hyperplanes, and the curve reflects this by breaking into irreducible components. 
In addition to the nice geometry, our interest in these degenerations comes from the obser- 
vation that they seem to produce curves with high curvature. What follows is independent 
of the rest of the paper and can be skipped upon first reading. 

To set up our degeneration in proper algebraic terms, we work over the field of 
Puiseux series over the field K = Q{A)(h,c) that was used above. The field comes 
with a natural t-adic valuation. Passing to the special fiber represents the process of letting 
the parameter t tend to 0. Our cost vector c has its coordinates in the Puiseux series field: 

(22) c = 

Here wi > W2 > ■ ■ ■ > Wn-i > are any rational numbers. We are interested in the special 
fiber of the central sheet C^\. This represents the limit of the central sheet as t approaches 
0. This induces a degeneration of the central curve C'2\ H {Ax = b}. We wish to see how, 
in that limit, the central curve breaks into irreducible curves in the affine space {Ax = b}. 
The ideal defining the special fiber of Ja,c is denoted in(JA,c) = JA,c\t=o- By a combi- 



natorial argument as in 24 , the maximal minors in (19) have the Grobner basis property 
for this degeneration. Hence we obtain the prime ideal of the flat family by simply dividing 
each such minor by a non-negative power of t. This observation implies the following result: 

Theorem 16. The central sheet degenerates into a reduced union of central sheets of 
smaller linear programming instances. More precisely, the ideal in(J^^c) is radical, and it 
has the following representation as an intersection of ideals that are prime when A is generic: 

(23) m{JA,c) = Pi f /num,d+l f^-l ... ^\] + Xi+3, ■ ■ ■ , Xn)j 

Proof sketch. The Grobner basis property says that in( Ja,c) is generated by the polynomials 



obtained from the maximal minors of ( 19 ) by dividing by powers of t and then setting t to 



zero. The resulting polynomials factor, and this factorization shows that they lie in each of 



the ideals on the right hand side of (23). Conversely, each element in the product of the 
ideals on the right hand side is seen to lie in in(J^ c)- To complete the proof, it then suffices 
to note that in(J^ c) is radical because its generators form a square-free Grobner basis. □ 

Example 17. Let n = 6 and c? = 3. The matrix A might represent the three-dimensional 



Klee-Minty cube. The decomposition of the initial ideal in (23) has three components: 

■ I T \ I \ lA J ^1^1 ^2^2 XzA-i x^A^ , ( A^ A^ A-^ A^ A^ 

in(JA,c) = (^5, a;6) n (det i i i i , ) n /num,4 -i ^-i ^-i ^-i ^-i • 

\ -L -L X LI \Aj -j^ ^2 '^3 5/ 

For general A, the ideal J a,c defines an irreducible curve of degree 10, namely the central 
path, in each of the 3-planes {Ax = b}. The three curves in its degeneration above are 
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irreducible of degrees 1, 3 and 6 respectively. The first is one of the lines in the arrangement 
of six facet planes, the second curve is the central path inside the facet defined by = 0, 
and the last curve is the central path of the polytope obtained by removing that facet. o 

In general, we can visualize the degenerated central path in the following geometric fashion. 
We first flow from the analytic center of the polytope to the analytic center of its last facet. 
Then we iterate and flow from the analytic center of the facet to the analytic center of its 
last facet, which is a ridge of the original polytope. Then we continue inside that ridge, etc. 



5. The Gauss Curve of the Central Path 
The total curvature of the central path is an important quantity for the estimation of 



the running time of interior point methods in linear programming 11,22,28,34,37 . In this 
section we relate the algebraic framework developed so far to the problem of bounding the 
total curvature. The relevant geometry was pioneered by Dedieu, Malajovich and Shub [IT] . 
Following their approach, we consider the Gauss curve associated with the primal central 
path. The Gauss curve is the image of the central curve under the Gauss map, and its arc 
length is precisely the total curvature of the central path. Moreover, the arc length of the 
Gauss curve can be bounded in terms of its degree. An estimate of that degree, via the 



multihomogeneous Bezout Theorem, was the workhorse in 11 . Our main result here is a 
more precise bound, in terms of matroid invariants, for the degree of the Gauss curve of the 
primal central curve. As a corollary we obtain a new upper bound on its total curvature. 

We begin our investigation by reviewing definitions from elementary differential geometry. 
Consider an arbitrary curve [a, 6] — )■ M*^, t i— )■ /(t), whose parameterization is twice differ- 
entiable and whose derivative f'{t) is a non-zero vector for all parameter values t G [a, 6]. 
This curve has an associated Gauss map into the unit sphere S"*""^, which is defined as 



™-i . , , fit) 



7 : [a, h] -> 5™-^ , t ^ 



wmw' 

The image 7 = 7([a, b]) of the Gauss map in S"^~^ is called the Gauss curve of the given curve 
/. In our situation, the curve / is algebraic, with known defining polynomial equations, and 
it makes sense to consider the projective Gauss curve in complex projective space P"^^^. By 
this we mean the Zariski closure of the image of the Gauss curve under the double-cover map 
gm-i _j, pm-i_ If = 2, SO that C is a non-linear plane curve, then the Gauss curve traces 
out several arcs on the unit curve S^, and the projective Gauss curve is the entire projective 



line P^. Here, the line P^ comes with a natural multiplicity, to be derived in Example 22 

If m = 3 then the Gauss curve lies on the unit sphere and the projective Gauss curve 
lives in the projective plane P^. Since a curve in 3-space typically has parallel tangent lines, 
the Gauss curve is here expected to have singularities, even if / is a smooth curve. 

The total curvature K of our curve / is defined to be the arc length of its associated Gauss 



curve 7; see 11, §3]. This quantity admits the following expression as an integral: 

The degree of the Gauss curve 7(t) is defined as the maximum number of intersection 
points, counting multiplicities, with any hyperplane in M™, or equivalently, with any equa- 
tor in 5™"^. This (geometric) degree is bounded above by the (algebraic) degree of the 
projective Gauss curve in P™~^. The latter can be computed exactly, from any polynomial 
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representation of C, using standard methods of computer algebra. Throughout this section, 
by degree we mean the degree of the image of 7 in P™-~i multiphed by the degree of the map 
that takes C onto 7(C). From now on we use the notation deg(7(C)) for that number. 



Proposition 18. [11} Corollary 4.3] The total curvature of any real algebraic curve C in 
is bounded above by tt times the degree of its projective Gauss curve in P*""^. In symbols, 

K < TT • deg(7(C)). 

Remark 19. In higher dimensions, the degree of the map from C onto 7(C) is typically equal 
to 1, in which case our definition of deg(7(C)) is exactly that used in |TT]. However, that the 
extra factor is needed can be seen by considering the case m = 2 of non-linear plane curves: 
the Gauss curve 7(C) is just P^, but it has a non-reduced structure coming from the map. o 

We now present our main result in this section, which concerns the degree of the projective 
Gauss curve 7(C), when C is the central curve of a linear program in primal formulation. As 
before, A is an arbitrary real matrix of rank d having n columns, but the cost vector c and 
the right hand side b are generic over Q(v4). The curve C lives in an (n — (i)-dimensional 
affine subspace of M", which we identify with M""'^, so that 7(C) is a curve in p^^'^^i. 

Let Ma,c denote the matroid of rank c? -|- 1 on the ground set [n] associated with the 
matrix (^). We write (/iq, hi, hd) for the h-vector of the broken circuit complex of Ma^c-, 
as defined in ( [l2| . In the generic case, Ma,c = Ud+i,n is the uniform matroid, as in Example [9] 
In this case, the coordinates of the h-vector are hi = ^ p^^^ special matrices A, this 

simplicial complex gets replaced by a pure shellable subcomplex of the same dimension, so 
the h-vector (weakly) decreases in each entry. Hence, the following always holds: 

(25) k < (''~'^^'~'^] for t = 0,l,...,d. 



As indicated, this inequality holds with equality when Ma,c is the uniform matroid. 
Theorem 20. The degree of the projective Gauss curve of the primal central curve C satisfies 



(26) deg(7(C)) < 2-J2^-h, 



=1 



In particular, we have the following upper bound which is tight for generic matrices A: 

— 1' 



(27) deg(7(C)) < 2 ■ {n - d - I] 



d-l 



The difference between the bound in (26 ) and the degree of 7(C) can be explained in terms 



of singularities the curve C may have on the hyperplane at infinity. The relevant algebraic 



geometry will be seen in the proof of Theorem 20, which we shall present after an example. 



Example 21. In the following two instances we have d = 3 and n = Q. 

(1) First assume that A is a generic 3 x 6-matrix. The arrangement of six facet planes 
creates 10 bounded regions. The primal central curve C has degree (^3^) = 10. It 
passes through the (3) = 20 vertices of the arrangements. In-between it visits the 10 
analytic centers of the bounded regions. Here the curve C is smooth and its genus 



is 11. This number is seen from the formula (29) below. The corresponding Gauss 



curve in P^ has degree 2 • 10 -|- 2 ■ genus(C) — 2 = 40, as given by the right hand side 



of (27). Hence the total curvature of the central curve C is bounded above by 407r. 
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(2) Next consider the Klee-Minty cube in 3-space. Normally, it is given by the constraints 

< zi < 1 , ezi < Z2 < I — ezi, and ez2 < z-s < 1 — ez2. 

To see this in a primal formulation ([T]), we use ^1,^2,^3 to parametrize the affine 
space {Ax = b}. The facets of the cube then correspond to the intersection of the 
coordinate hyperplanes with this affine space. This is given by the matrices 



A 



( 1 


1 











o\ 


2e 





1 


1 








2e2 





2e 





1 


1 




C2 


ca 


C4 


C5 





and b 



Here e is a small positive real constant. The above 4x6-matrix is not generic, and 
its associated matroid Ma,c is not uniform. It has exactly one non-basis, and so 
the h-vector equals {ho, hi, h2, h^) = (1,2,3,3). The central curve C has degree 
hi = 9. In the coordinates used above, the curve is defined by the 5x5- minors 
of the 5x6-matrix which is obtained from the 4x6-matrix (^) by adding one row 
consisting of reciprocal facet equations: 



, (1 - zi) , {z2 - ezi) , {1- Z2- ezi) , {z^ - ez2) , (1 



Z3 



According to Theorem 20, the degree of the Gauss curve 7(C) in P' 



- eZ2j 
is bounded 



above by 2 Yli=i i ' hi = 34. A computation using Macaulay2 |l5j reveals that 
degree(7(C)) = 32 and that the total curvature is bounded by 327r. o 



Proof of Theorem For the proof we shall use the generalized Pliicker formula for curves: 



(28) 



deg(7(C)) = 2 ■ deg(C) + 2 ■ genus(C) - 2 



The formula in (28 ) is obtained from 23 , Thm. (3.2)] by setting m = 1 or from 16 , Eq. (4.26)] 
by setting k = 0. The quantity k is a non-negative integer and it measures the singularities 
of the curve C. We have k = whenever the projective curve C is smooth, and this happens 
in our application when M4 c is the uniform matroid. In general, we may have singularities 
at infinity because here the real affine curve C has to be replaced by its closure in complex 
projective space P"-~'^, which is the projectivization of the affine space defined by Ax = b. 
The degree and genus on the right hand side of (28) refer to that projective curve in P"~'^. 



The references above actually give the degree of the tangent developable of the projective 
curve C, but we see that this equals the degree of the Gauss curve. The tangent developable 
is the surface obtained by taking the union of all tangent lines at points in C. The pro- 
jective Gauss curve 7(C) is obtained from the tangent developable by intersecting it with a 
hyperplane, namely, the hyperplane at infinity, representing the directions of lines. 

In the formula (28), the symbol genus(C) refers to the arithmetic genus of the curve. We 
shall now compute this arithmetic genus for primal central curve C. For this we use the 
formula for the Hilbert series of the central sheet due to Terao, in Theorem 1.2 on page 551 
of 31 . See the recent work of Berget [6] for a nice proof of a more general statement. 

As seen in the proof of Proposition 12, the Hilbert series of the coordinate ring of the 
central sheet equals 

ho + hiz + h2z'^ 
(1^ 



hdz" 



\d+2 
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The central curve C is obtained from the central sheet by intersection with a general linear 
subspace of dimension n — d. The (projective closure of the) central sheet is arithmetically 
Cohen-Macaulay since it has a flat degeneration to a shellable simplicial complex, as shown 



by Proudfoot and Speyer 24 . We conclude that the Hilbert series of the central curve C is 

d 



ho + hiz + /i2-2 H h hdz"- 



d 



J2 iJ2^.)-m+J2a-j)h, 

j=0 



+ 0{z 



d-l■^ 



m>d '- i=0 

The parenthesized expression is the Hilbert polynomial of the projective curve C. The degree 



of C is the coefficient of m, and using (13) we again find this to be the Mobius number: 

d 

degree(C) = \fi{A,c)\ = 

1=0 

The arithmetic genus of the curve C is one minus the constant term of its Hilbert polynomial: 

d 

(29) genus(C) = l-5^(l-j>,. 

j=0 



We now see that our assertion (26) follows directly from the generalized Pliicker formula (28) 



For fixed d and n, the degree and genus of C are maximal when the matrix A is generic. In 



this case, hi equals the right hand side of (25), and we need to sum these binomial coefficients 



times two. Hence, our second assertion (27) follows from the identify 

d 

^ z - ( ^ : " " ) = (n-d-1 



d + i 
i 



n — 1 
d-1 



This completes the proof of Theorem 20 



□ 



Example 22. Let d = n — 2 and suppose A is generic. The primal central curve C is a plane 
curve. Our h-vector equals (1, 1, 1). The proof of Theorem 20 reveals that the degree of 
Cisd+l = n — 1 and the genus of C is (2). The Gauss curve 7(C) is the projective line 
P^, but regarded with multiplicity deg(7(C)) = {d + l)d. This number is the degree of the 
projectively dual curve C^. The identity (28) specializes to the Pliicker formula for plane 



curves, which expresses the degree of in terms of the degree and the singularities of C. o 
We close this section by showing how to compute the Gauss curve for a non-planar instance. 



Example 23. Let n = 5,d = 2 and A 



11111 



. The primal central curve has 



12 3 4 

degree 6 and its equations are obtained by clearing denominators in the 4x4- minors of 

/ 1 1 1 1 1 \ 

1 2 3 4 



Cl 

-91) 



C2 

-2zi + Z2- 92) 



^1 



C3 

2z2 + Z3- gs) 



C4 

[Z2 - 2Z3 - 5(4) 



^3 



C5 
-95) 



V 



The Cj and gj are random constants representing the cost function and right hand side of ([T]). 
To be precise, the vector g = {gi, g2, 93, 94., 95)^ satisfies Ag = b as in Sections [g] and [7| The 
linear forms in z = [zi, Z2, z^) come from the change of coordinates B^z — x = g where B is 
a 3 X 5 matrix whose rows span the kernel of A. Writing I for the ideal of these polynomials. 



THE CENTRAL CURVE IN LINEAR PROGRAMMING 



19 



the following one-line command in the computer algebra system Macaulay2 15 computes 
the defining polynomial of the Gauss curve in P^: 

eliminate ({zl ,z2 ,z3} , I+minors (1 ,matrix{{u, v,w}-}*dif f (matrix{-[zl} ,{z2} ,-[z3}-}- ,gens I) ) ) 

The output is a homogeneous polynomial of degree 16 in the coordinates u, v, w on P^. Note 
that deg(7(C)) = 16 is consistent with Theorem 20 because h = {ho, hi, = (1, 2, 3). 



o 



6. The Primal-Dual Formulation and the Average Total Curvature 

This short section offers a dictionary that translates between the primal and the dual 
central curve. We begin by offering an algebraic representation of the primal-dual central 
curve that is more symmetric than that given in the Introduction. This will allow us to take 
our main results so far, previously only stated for the primal LP, and show that they hold 
verbatim also for the dual LP. As primary application, we then derive a matroid-theoretic 



refinement of the Main Theorem in 11 on the average total curvature of the central path. 



Let Ca denote the row space of the matrix A and £^ its orthogonal complement in M". 
We also fix a vector g G M" such that Ag = b. By eliminating y from the system ^ 
in Theorem [l| we see that the primal-dual central path (x*(A), s*(A)) has the following 
symmetric description: 



(30) X G Cj^ + g , s G i-A + c ana XiSi = X2S2 

The implicit {i.e. A-free) representation of the primal-dual central curve is simply obtained 
by erasing the very last equality "= A" in (30). Its prime ideal is generated by the quadrics 
Sj and the affine-linear equations defining Cj^ + g in x-space and Ca -I- c in s-space. 



s G Ca + c and 



A. 



The symmetric description of the central path in (30) lets us write down the statements 



from Section |4] for the dual version. For example, we derive equations for the dual central 
curve in s-space or y-space as follows. Let B be any {n — d) x n matrix whose rows span 
the kernel of A. In symbols, Cb = C-^. The (dual) central curve in s-space is obtained by 
intersecting the (i- dimensional affine space Ca + c = { s G : -Bs = Be } with the central 
in (17). To obtain the central curve in y-space, we substitute Sj = ^^=1 djiVj —Ci 

and 



sheet 



in the equations defining C^^ . This gives dual formulations of Theorems 



13 



20 



Corollary 24. The degree of the dual central curve of equals the Mobius number \n{B, g)| 
and is hence at most (^Zl) ■ The prime ideal of polynomials that vanish on the central path 
is generated by the circuit polynomials (18), but now associated with the space generated by 
the rows of B and the vector g, and the n — d linear equations in s given by Bs = Be. 

Corollary 25. The degree deg(7(C)) of the Gauss image of the dual central curve C is at 
most2-^^i-hi, whereh = h{Br{MB,g)). This implies the bound deg(7(C)) < 2-{d—l)-(^~^). 



Remark 26. Theorem 20 and Corollary 25 give a strengthening of Theorem 1.1 of |11|. 



Megiddo and Shub |21] proved the lower bound {d — l)vr/2 for the total curvature of the 
central path of a 0? variable LP with d+1 constraints. For such instances, our upper bound 
is tight up to a constant. Our main contribution is that we adjust the upper bound to the 
specific matroid of the constraint matrix A. o 



Dedieu, Malajovich and Shub 11 investigated the average curvature of the central paths 



over all the bounded regions of the associated hyperplane arrangement for the linear program 
and show this to be at most 27rd when A is generic. Combining (11), Proposition 18 



Theorem 20 and their dual versions seen in this section, we obtain the following statement: 
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Theorem 27. The average total curvature of the primal central path over the bounded regions 
of the hyperplane arrangement {xi = 0}jg[„] in the affine space £^ + g is at most 

(31) 27r ■ ^to^-^^(Ac) 

^ ^ IMA) I ■ 

If A is a generic d x n matrix, then this average total curvature is at most 27i{n — c? — 1). 

Similarly, the average total curvature of the dual central path over the bounded regions of 
the hyperplane arrangement {si = 0}jg[„] in the affine space Ca + c is at most 

(32) 2.._Eto^-M5,g) 



HB)\ 

In the generic case, this average total curvature is bounded above by 2TT{d — 1). 



Note that the h- vector entries in the numerator of (31 ) refer to the rank d + 1 matroid Ma 



while the Mobius number in the denominator refers to the rank d matroid Ma- Likewise, 



the h- vector entries in the numerator of (32) refer to the rank n — d + 1 matroid M^^g while 



the Mobius number in the denominator refers to the rank n — d matroid Mb- 

Example 28. We consider two examples to demonstrate the finer bounds provided by 



Theorem 27 First, recall the primal LP formulation of the Klee-Minty Cube in Example [21] 
We had already seen that our formulas predict the total curvature is bounded by 327r. We 
can calculate |/i(A)| for the 3x6 defining matrix. The matroid in question is not uniform (it 
only has 14 bases). We recover //(A) through the computation and evaluation of the Tutte 
polynomial as shown in 16 In this case we obtain |/i(A)| = 5 thus Formula 31 says the 
average curvature is no more than This is a better bound than the generic case formula. 

Second, recall the DTZ snake of Example [3] and Figure [2] which is given in dual LP 
form. The curve has degree four and its projective closure C is smooth in P^. So, we have 



deg(7(C)) = 12, and Proposition 18 gives an upper bound 127r on the total curvature of the 
entire central curve in M?. The Mobius number |/i(-B)| is in this case 10, thus we get from 
formula (32) that the average curvature is 1.27r. Once more, our bound is better than the 
generic case prediction. It is interesting to see the large discrepancy between the curvature 
within a single feasible region (calculated in Example [3]) and the average total curvature, o 

7. Global Geometry of the Central Curve 

In this section we return to the central path in its primal-dual formulation, and we study 
several geometric properties. We shall show how the central curve connects the vertices of 
the hyperplane arrangement with the analytic centers of its bounded regions. This picture 
behaves well under duality, as the vertices of the two arrangements are in natural bijection. 



For an algebraic geometer, it is natural to replace each of the affine spaces in (30) by 
a complex projective space and to study the closure C of the central curve in x P". 
Algebraically, we use homogeneous coordinates [xq : Xi : ■ ■ ■ ■- x„] and [sq : si : ■ ■ ■ : 
Writing x and s for the corresponding column vectors of length + 1, we represent 

C\ + g by {xGP": (-b,A)-x = 0} and Ca + c by {s G P" : (-5c, B) ■ s = 0}. 

The projective primal-dual central curve C is an irreducible curve in P" x P". Its bi- 
homogeneous prime ideal in /^[xo, Xi, . . . , x„, Sq, Si, . . . , s„] can be computed by the process 
of saturation. Namely, we compute it as the saturation with respect to (xqSo) of the ideal 
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generated by the above linear forms together with the bi-homogeneous forms 
The resulting ideal is irreducible because the vectors c and g are generic. 

Example 29. Let d = 2,n = 4. Fix 2x4-matrices A = (aij) and B = [bij) such that 
Cb = C^- We start with the ideal J in K[xo, . . . ,X4, sq, . . . , S4] generated by 

aii(xi - gixo) + ai2(x2 - 92X0) + ai3{xs - g^xo) + au{x4 - QaXq), 
021 - QiXq) + a22(a;2 - 92X0) + 023(3:3 - Q^iX^) + 024(^4 - QaXq), 

hll{si - CiSo) + fel2(s2 - C2S0) + 613(53 - C3S0) + 614(54 - C4S0), 
621(^1 - QiSq) + 622(^2 - C2S0) + 623(53 - C3S0) + 624(54 - C4S0), 
SiXi — S2X2, 52X2 — 53X3, S3X3 — S4X4. 

The central curve C is irreducible in x P^. It has degree (3,3) unless A is very special. 
The prime ideal of C is computed as the saturation (J : (xoSo)°°). We find that this ideal 
has two minimal generators in addition to the seven above. These are cubic polynomials in 
X and in s, which define the primal and dual central curves. They are shown in Figure |4j o 

Returning to the general case, we note that, since our curve C lives in P" x P", its degree 
is now a pair of integers {dy^,ds). These two integers can be defined geometrically: d^^ is 
the number of solutions of a general equation Yl^=o(^i^i = on the curve C, and dg is the 
number of solutions of a general equation Yll=o f^^^^ = on C 

Corollary 30. Let c and g be generic vectors in M" and let [d^, dg) be the degree of the 
projective primal-dual central curve C C P" x P". This degree is given by our two Mobius 



numbers, namely rfx = |/i(v4,c)| and dg = |/i(-B,g)|. These numbers are defined in (15) 



Proof. The projection from the primal-dual central curve onto its image in either x-space or 
s-space is birational. For instance, if x is a general point on the primal central curve then 
the corresponding point s is uniquely obtained by solving the linear equations X j 5 j — X j s j on 
Ca + c. Likewise, given a general point s on the dual central curve we can recover the unique 
X such that (x, s) G C. This implies that the intersections in P" x P" that define and dg 
are equivalent to intersecting the primal or dual central curve with a general hyperplane in 
P", and the number of points on that intersection is the respective Mobius number. □ 

Next we discuss the geometry of this correspondence between the primal and dual curves 
at their special points, namely vertices and analytic centers of the relevant hyperplane ar- 
rangements. These special points are given by intersecting the primal-dual curve C with 
certain bilinear equations. The sum of our two Mobius numbers, + dg, is the number of 
solutions of a general bilinear equation 'jijXiSj = on the primal-dual central curve C. 
Two special choices of such bilinear equations are of particular interest, namely, the bilinear 
equation xqSq = and the bilinear equation XjSj = for any i > 1. Note that the choice of 
the index i does not matter for the second equation b Si holds on the curve. 

Let us first observe what happens in P" x P" when the parameter A becomes 0. The 
corresponding points on the primal-dual curve C are found by solving the equation xiSi = 
on C. Its points are the solutions of the n equations xiSi = X2S2 = ■ ■ ■ = = on the n- 
dimensional subvariety (C-^ + g) x (Ca + c) of P" x P". This intersection now contains many 
points in the product of affine spaces, away from the hyperplanes {xq = 0} and {sq = 0}. 
We find the points by solving the hnear equations Xj^ = ■ ■ ■ = = on C\ + g and 
Sji = ■ ■ ■ = Sj^_^ = on Ca + c, where {ii, . . . ,id} runs over all bases of the matroid M{Ca) 




and {ji, . . . ,jn-d} is the complementary basis of the dual matroid M(£^)* = M{Cb)- These 
points represent vertices in the hyperplane arrangements H and H*, where 

"H denotes {xi = 0}ie[n] in + g ^-nd "H* denotes {sj = 0}jg[„] in Ca + c. 

The vertices come in pairs corresponding to complementary bases, so the points with param- 
eter A = on the primal-dual central curve C are the pairs (x, s) where x is a vertex in the 
hyperplane arrangement H and s is the complementary vertex in the dual arrangement "H*. 

Imposing the equation XqSq = means setting A = oo in the parametric representation 
of the central curve. The points thus obtained have the following geometric interpretation 
in terms of bounded regions of the hyperplane arrangements H and "H*. We recall that the 
analytic center of the polytope P = {Ax = b, x > 0} is the unique point in the interior of 
P that maximizes the concave function Yl^=i^'^s{^i)- The algebraic characterization of the 
analytic center is that the gradient of Y17=i ^^si^i)^ which is x~^, is orthogonal to the affine 
space + g = {A'x = b}. This means that the vector x~^ lies in the row span Ca of A. 
Let denote the the coordinate- wise reciprocal of Ca, regarded as a subvariety in P*^. 

Proposition 31. The intersection C^^ fl (C^ + g) defines a zero- dimensional reduced 
suhscheme of the affine space P"\{xo = 0}. All its points are defined over M. They are the 
analytic centers of the polytopes that form the hounded regions of the arrangement H. 

Proof. The analytic center of each bounded region is a point in the variety C^^ fl (£^+g), by 
the gradient argument in the paragraph above. This gives us |/x(y4)| real points of intersection 



on n (C^ + g) • By replacing C^\ with C^^ in Proposition 12 , we know that the degree 
of is This shows that these real points are all the intersection points (over C) and 



they occur with multiplicity one. This argument closely follows the proof of Lemma 11 □ 



Naturally, the dual statement holds verbatim, and we shall now state it explicitly. 
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Proposition 32. The intersection {C-^)~^ fl {Ca + c) defines a zero- dimensional reduced 
subscheme of the affine space P"\{so = 0}. All its points are defined over M. They are the 
analytic centers of the polytopes that form the bounded regions of the dual arrangement H* . 

We summarize our discussion witli tlie following tlieorem on the global geometry of the 
primal-dual central curve. Figure |4] serves as an illustration for the case n = 4 and d = 2. 

Theorem 33. The primal central curve in ^-space passes through each vertex of the ar- 
rangement % as the dual central curve in s-space passes through the corresponding vertex of 
Ti* . As the primal curve passes through the analytic center of each bounded region in %, the 
dual curve reaches the hyperplane {sq = 0}. Similarly, as the dual curve reaches the analytic 
center of each bounded region in W , the primal curve meets the hyperplane {xq = 0}. 



Remark 34. The picture painted by Theorem 33 resembles the unpublished results of Adler 



and Haimovich [l|[8, 17 on co- optimal paths. They considered an LP with two objective 
functions c and c', and they studied the parametric objective c + pc' . As p runs from — oo to 
cxD, the optimal solutions form a path of edges in the arrangement l-L. Adler and Haimovich 
showed that the average length (in edges) of this co-optimal path in a cell of conditional 
on the path being nonempty in that cell, is at most 0(min((i, n — d)). This bound is close to 



the curvature bounds in 11 . It would be interesting to explore possible connections between 
co-optimal edge paths and the degenerations of central curves constructed in Theorem [161 o 



The primal central curve misses precisely one of the antipodal pairs of unbounded regions 
of "H. It corresponds to the region in the induced arrangement at infinity that contains the 
point representing the cost function c. For a visualization see the picture of the central curve 
in Figure^ Here a projective transformation of moves the line from infinity into M^. 



The points described in Propositions |3T] and [32] are precisely those points on the primal- 
dual central curve C for which the parameter A becomes oo. Equivalently, in its embedding 
in P" X P", these are solutions of the equation xqSo = on the curve C. Note, however, that 
for special choices of A, the projective curve C will pass though points with xq = sq = 0. 
Such points, which lie on the hyperplanes at infinity in both projective spaces, are entirely 
independent of the choice of c and g. Indeed, they are the solutions of the equations 

(33) s G Ca = ker _B , x G C\ = kei A, and XiSi = X2S2 = ■ ■ ■ = XnSn = 0. 

The solutions to these equations form the disjoint support variety in P"~i x P""-*^, which 
contains pairs of vectors in the two spaces Ca and C^ whose respective supports are disjoint. 

Example 35. Figure [5] shows the primal-dual central curve for the matrices 

, A -1 \ J „ A 1 1 
^=^0 11 -1) ^=^0011 

The disjoint support variety consists of the point (x, s) = ([0 : : 1 : 1], [1 : —1 : : 0]) in 
P3 X p3. After we identify P^ x P^ with {(x, s) G P^ x P^ : xq = 0, sq = 0}, this point lies 
on the primal-dual central curve and appears as point "c" in Figure [5| o 

When studying the global geometry of the primal-dual central curve, it is useful to start 
with the case when the constraint matrix A is generic. In that case, our matroids are uniform. 



namely M{Ca) = Ud,n and M{Cb) = Un-d,ni and the disjoint support variety (33) is empty. 



This condition ensures that the intersections of the curve C with both the hypersurfaces 
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Figure 5. A primal-dual curve that intersects its disjoint support variety. 

{xqSq = 0} and {xiSi = 0} in P" x P" is reduced, zero-dimensional and fully real. The 
number of points in these intersections is the common number of bases in the two matroids: 

The intersection points of C with {xqSo = 0} are the pairs (x, s) where either x is an analytic 
center in and s lies at infinity in the dual central curve, or x lies at infinity in the primal 
central curve and s is an analytic center in T-t*. The intersection points of C with {xiSi = 0} 
are the pairs (x, s) where x is a vertex in 7i and s is a vertex in Ti*. Figure |4] visualizes 
the above correspondences for the case n = 4 and d = 2. If we now degenerate the generic 
matrix A into a more special matrix, then some of the above points representing vertices and 



analytic centers degenerate to points on the disjoint support variety (33), as in Example 35 
In Theorem [33] we did not mention the degree of the primal or dual central curve. For the 
sake of completeness, here is a brief discussion of the geometric meaning of the degree d^'- 

Remark 36. Consider the intersection of the primal central path with a level set {c^x = cq} 
of the objective function c. Varying cq produces a family of parallel hyperplanes. Each hyper- 
plane meets the curve in precisely d^ points, all of which have real coordinates. These points 
are the analytic centers of the (ra— ci— l)-dimensional polytopes obtained as the bounded re- 
gions of the induced arrangement of n hyperplanes {xi = 0} in the affine space {x G : 
Ax = b, c-^x = Co}. We can see d^ as the number of {n — d — l)-dimensional bounded 
regions in the restriction of the arrangement H to a general level hyperplane {c"^x = Cq}. 
In particular, this gives a one-dimensional family of hyperplanes all of whose intersection 
points with the central curve are real, as suggested by the left diagram in Figure [3j o 

A main theme in this paper was that projective algebraic geometry provides an alternative 
view on optimality and duality in optimization, as well as powerful tools for analyzing interior 



point methods. This parallels the discussion of semidefinite programming in 27 . See also 130 



for a statistical perspective on analytic centers and central curves in the semidefinite context. 
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